Method and apparatus for providing motion control signals between a fixed camera and a ptz camera

ABSTRACT

The method and apparatus for providing motion control signals between a fixed lens static camera and a PTZ camera that includes an adjustable-in-use focal length adapted to provide improved surveillance. In one aspect of the present invention, the improved method and apparatus includes an improved PTZ process in which fuzzy logic based information is utilized to achieve a set of more reliable P/T/Z parameters. In another aspect of the invention, the fixed lens static camera and the PTZ camera are mounted vertically on top of each other. In another aspect of the present invention, there is provided an automatic self-calibration between the fixed lens static camera and the PTZ camera. In another aspect of the present invention, there is provided a cost-effective standalone single board DSP solution.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a method and apparatus for providing motion control signals between a fixed camera and a PTZ camera, and, more particularly providing motion control signals between a fixed camera and a moving PTZ camera for detecting a moving object and taking a picture of the moving object.

2. Background of the Invention

It is desirous for surveillance systems to monitor a wide area, as well as capture detailed information about any suspicious target within that wide area. Practically, however, this is very difficult to achieve, since the goal of wide area monitoring and high resolution target image acquisition are opposite. This is since a wide area monitoring system needs a camera lens with a large field of view, and thus a short focal length, but to identify an object at a distance, a telephoto lens that has a small field of view and a large focal length is needed. FIG. 1A illustrates a picture taken with a camera having a wide angle lens with a short focal length, and FIG. 1B illustrates a picture of an object with the field of view of FIG. 1A taken with a camera having a telephoto lens to illustrate these opposing considerations.

In addition to using a telephoto lens, if substituting the telephoto lens, which has a fixed focal length, for a zoom lens, which has a range of larger focal lengths, the technical complexity increases even further.

It is known, however, to use a side-by-side combination of cameras together within a surveillance system, such as a fixed view wide angle camera mounted next to a zoom camera For example, the article entitled “A Master Slave System to Acquire Biometric Imagery of Humans at a Distance” by Xuhui Zhou et al; IWVS, 2003; Nov. 7, 2003 describes a system that includes a master camera having a wide angle lens, a slave camera having a fixed-in-use zoomed lens next to it, and a pan-tile-zoom (PTZ) process that is used to control movement of the slave PTZ camera based upon information received from the master camera and the slave camera. While this system describes basic elements that are known in such surveillance systems, the PTZ process as described has disadvantages.

One disadvantage with known dual rig cameras, such as the one described above, is that manual calibration between the different cameras is required. In this manual calibration, first a series of pixel locations are picked up. Then for each of those pixels, the PT camera is moved manually to center at that pixel and the P/T values are recorded. After manually calibration all those preselected points, interpolation was used to get a denser map. This process is tedious and requires significant human effort. And this process is performed every time a different lens and every time a different configuration is used. Further, the system requires regular calibration after the deployment due to slight physical shifts between the two cameras.

While the above methods of use and configurations of dual rig cameras are useful, improvements to make them more durable, efficient and cost-effective are needed.

SUMMARY OF THE INVENTION

The method and apparatus for providing motion control signals between a fixed lens static camera and a PTZ camera that includes an adjustable-in-use focal length adapted to provide improved surveillance.

In one aspect of the present invention, the improved method and apparatus includes an improved PTZ process in which fuzzy logic based information is utilized to achieve a set of more reliable P/T/Z parameters.

In another aspect of the invention, the fixed lens static camera and the PTZ camera are mounted vertically on top of each other.

In another aspect of the present invention, there is provided an automatic self-calibration between the fixed lens static camera and the PTZ camera.

In another aspect of the present invention, there is provided a cost-effective standalone single board DSP solution.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects of the present invention will become readily apparent when reading the following detailed description taken in conjunction with the appended drawings in which:

FIG. 1A illustrates a picture taken with a camera having a wide angle lens with a short focal length;

FIG. 1B illustrates a picture of an object with the field of view of FIG. 1A taken with a camera having a telephoto lens;

FIG. 2A illustrates a block diagram of the dual rig camera system according to a preferred embodiment of the present invention; and

FIG. 2B illustrates a flow diagram of the dual rig camera system according to a preferred embodiment of the present invention.

FIGS. 3A and 3B illustrate object detection through the static camera and the PTZ camera according to the present invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention provides a method and apparatus for providing motion control signals between a fixed camera and a moving camera, and, more particularly providing motion control signals between a fixed static camera and a PTZ camera.

The dual-camera rig 200 according to the present invention is illustrated in FIG. 2A. As shown the two cameras 220 and 250 are mounted on a common platform 210. The camera 220 on top is a fixed lens camera (hereafter referred to as static camera 220 ), and the camera 250 on the bottom is a PTZ camera (hereafter referred to as PTZ camera 250 ). This top-bottom configuration preferred to a side-to side configuration for the following reasons:

a. The two cameras 220 and 250 can be mounted very close to each other, preferably such that the lenses of each are within 15 cm of each other, which makes the calibration and control much simpler.

b. There is no occlusion between the two cameras 220 and 250. Videos from both Static camera 220 and PTZ camera 250 are fed into a single DSP board 270, preferably a TI DSP board, which accommodates all the video analysis. The PTZ camera 250 is controlled via the RS485 port of the DSP board 270 and results are sent out through an Ethernet link 280.

On the DSP board 270, subsystems for the static camera 220 and the PTZ camera both run. The detection and tracking algorithms are running constantly for the static camera 220, whereas The detection and tracking algorithms for the PTZ camera 250 are activated if the static camera detects something and commands are then sent to the PTZ camera, as is illustrated in the flowchart of FIG. 2B and described further hereinafter.

As illustrated in the flowchart, and then certain of the steps explained in more detail hereinafter, in the step 110 after starting a determination is made whether automatic self-calibration is needed. If so, step 115 follows, and that occurs, as explained hereinafter. Thereafter, or if no automatic self-calibration is needed, step 120 follows, and the PTZ camera 250 is reset. Thereafter, in step 125, target detection occurs, using the static camera 220, with the static camera module. If detection of a suspicious target or targets occurs, then step 135 follows and the suspicious target is tracked in step 135, as detailed further hereinafter. If there is not a suspicious target detected, then the sequence returns to step 120. If more than one target is detected, the targets are tracking in parallel, and a different tracker is assigned to each detected target and the tracking will continue until terminated. The description hereinafter will assume the tracking of a single target.

From the target being tracked, control signals that provide the target size and location information are transmitted to the control algorithm, which then determines the PTZ parameters for the PTZ camera 250 in step 140, as described further hereinafter. Steps 135 and 140 continue until a signal is received (from step 175 described later) that the object tracking using the static camera 220 should stop.

Also following step 140, in step 150, the PTZ camera movement control uses the PTZ parameters (including specific focal length within the adjustable focal length range of the PTZ camera) to move the PTZ camera and adjust the focal length in order to detect the object, and be able to take a recognizable picture of the object. Thus, in step 155, with the object detected by the PTZ camera 250, a picture can be taken. Thereafter, in step 160, active tracking (as also described hereinafter) can be used, using the image information obtained from the PTZ camera 250. If a decision is made to continue tracking in step 165, then the image information from the PTZ camera 250 is fused with image information from the static camera 220 in step 145, as described further hereinafter, and used to continue the active tracking of the object (though active tracking can take place using only image information from one of the cameras). If tracking does not continue, then in step 170 the process is terminated as to that object and the PTZ subsystem, and then a decision is made in step 175 whether to terminate the tracking as it relates to the static camera 220 and the static camera subsystem. Step 175 is thus used to control how the detected objects from the static camera 220 are processed one by one by the static camera 220 and the PTZ camera 250.

Once a moving object is detected within the field of view of the static camera 220 as shown by step 135 (also see FIG. 3.a), the location and size of the detected object are passed to the controlling algorithm, which optimizes the pan, tilt and zoom parameters based on the current status of the PTZ camera 250 and the correlation map. The controlling algorithm can be implemented in different ways, but controlling algorithm can be viewed as essentially a table look-up. Having already calibrated the dual-camera unit, and having the correlation map, for a new location, the controlling algorithm searches the correlation map (look-up table), based upon size of the object and location, and obtains the proper P/T/Z values associated with the that size and location, to send to the PTZ camera 250. Under some cases, bilinear interpolation is utilized if there is no exact match in the look-up table.

The optimized P/T/Z parameters are then sent to the PTZ camera 250, as described previously in step 140. And this set of P/T/Z parameters puts the object, which may be moving, at the center of the images captured from the PTZ camera 250 with the designated size (see FIG. 3.b.) The PTZ camera 250 will actively follow the object till the predefined condition is satisfied, as shown by steps 160 described previously.

The above-described overall operation sequence has a number of aspects which will now be described in more detail, describing further certain novel features of the present invention.

Automatic Self-Calibration

Another feature distinguishing the present invention is the automatic self-calibration capability, illustrated as step 115 in FIG. 2B, which contrasts with the manual calibration of existing camera rigs. In contrast to the manual self-calibration known in the art, the automatic self calibration is carried out automatically, which will be detailed hereinafter. The automatic self-calibration operates as follows.

Depending on the specific requirements, the system 200 can be configured work under two moods: fixed-zoom mood and variable-zoom mood. Fixed-zoom mood is applicable where both the environment settings and the size of the object of interested are known. For instance, if the system 200 will monitor a building entrance and will be mounted on a pole and pedestrians are the only object detected, the focal length of the PTZ camera 250 based on the distances as well as other requirements can be calculated, and a fixed focal length used.

Fixed-zoom mood is only a special case of the variable-zoom mood, where the focal length is adjustable depending on the objects detected through the static camera 220. If an object shows up at a location far away from the static camera 220, the size of the object will be small. To obtain a clear shot, a large zoom value will be necessary. On the other hand, if an object shows up at a location closer to the static camera 220, a smaller zoom value will be needed.

First, the automatic self-calibration algorithm is explained for the system 200 working under fixed-zoom mood.

1. Extract N points from the static camera 220 image using Eigenvalue-based corner detection method or any other robust corner detection method.

2. Pick up a point ps_(i) from the list.

3. Set the focal length of the PTZ camera 250 same as the focal length of the static camera 220 (focal length of the static camera 220 is known from the manufacture) and pan/tilt parameters to the zero location. The zero location of the PTZ camera 250 is the location where the optical axis of the static camera 220 and PTZ camera 250 are parallel to each other.

4. Find the corresponding point pp_(i) from the PTZ camera 250 using the correlation map, which is known to us from the configuration.

5. Compute the pan and tilt values Theoretical (P_(i), T_(i), Z) assuming we need to move the pp_(i) to the center of the PTZ camera 250 image at zoom value Z based on the relation between different zoom values and the field of view.

6. Track point pp through the PTZ camera 250 using feature point tracking algorithm. Based on the feed-back from the tracking, move the PTZ camera 250 so that pp is presented at the center of the images from PTZ camera 250. Record the P/T/Z parameters Empirical (P_(i), T_(i), Z) where Z is a fixed value. The Empirical (P_(i), T_(i), Z) reflects the Static camera 220 and PTZ camera 250 correlation between point ps_(i) from the static camera 220 and center point from PTZ camera 250.

7. Do the average: C_(i)(P_(i), T_(i), Z)=0.5×Theoretical (P_(i), T_(i), Z)+0.5×Empirical (P_(i), T_(i), Z).

8. Increment i to i=i+1, and go back to Step 2 if i<=N.

9. Obtain a denser static camera-PTZ camera correlation map M from the C_(i)(P_(i), T_(i), Z), for i=1 . . . N using interpolation.

The above process calculates the static camera-PTZ camera correlation twice for each point ps_(i): One is pure theoretical calculation as described in Step 5, and the other is empirical results based on tracking (see Step 6). The final mapping is preferably based on some type of average of these two sets of data.

To calibrate the system 200 for the purpose of working under variable-zoom mood, the following procedure should be followed:

1. Sample the zoom range [Z_(min) Z_(max)] to get N different zoom values {Z₁, Z₂, . . . Z_(N)}

2. for i=1, 2, . . . , N, calibrate the rig by following the same procedure as the calibration for fixed-zoom mood by setting Z=Z_(i) to get the correlation map M_(i).

3. Find the target size mapping SM between the static camera and PTZ camera.

During an operation, after a moving target has been detected through the static camera 220, its size will preferably be retrieved first. Based on the size of the target at the view from the static camera 220 and the designated size at the PTZ camera 250, a zoom parameter Z_(i) can be obtained by consulting a size map SM, which is part of the correlation map referred to above Once Z_(i), is known, the correlation map M_(i) will be utilized to direct PTZ camera in the same manner as that in the fixed-zoom mode.

P/T/Z Control

Another feature of the system 200 is the control mechanism. The P/T parameters of the existing systems depend upon either the detection results through the fixed zoom camera or detection results through the static camera, and never both. With the present invention, the optimization of the P/T/Z parameters takes the advantage of the information from both the static camera 220 and the PTZ camera 250, which makes the control more reliable and robust, as well as allows for an adjustable Z parameter, as has been mentioned previously.

Once a moving object is detected through the static camera 220, a tracker program will be activated for the detected object, both in the static camera 220 and a separate tracker for the PTZ camera 250. A conventional tracker program can be used, but one that works well is that described in U.S. Patent Application entitled “Method and Apparatus for Adaptive Mean Shift Tracking filed on the same day as this application, bearing attorney reference 042503/0317466, the contents of which are expressly incorporated by reference herein. At the same time, the PTZ camera 250 is commanded to turn to the direction of the moving target, acquire the moving target, and track it while taking high resolution images, as described hereinafter. In particular, the tracking through the PTZ camera 250 is active tracking—the PTZ camera 250 will follow the object actively based on the feed-back from the tracking algorithm. There are two motions in this scenario: the motion of the object and the motion of the PTZ camera 250. This composite motion poses a challenge for the tracking algorithm. But much detailed information can be obtained through the PTZ camera 250, which is helpful for tracking. In contrast, the tracking through the static camera 220 involves only one motion—the motion caused by the moving object. However, this single motion fact does not guarantee a better tracking result than that through the PTZ camera 250 because the tracking with the static camera 220 works on a small window with much less detailed information about the object.

Consequently, combining the feed-back from both of the two trackers (one for the static camera 220, the other for the PTZ camera 250) make the system 200 more reliable and robust Fuzzy logic has been used to fuse the feed-back as following: PTZ _(i) =αPTZ _(i) ^(s) +βPTZ _(i) ^(p)  (1) where PTZ_(i) is the final P/T/Z parameters, PZT_(i) ^(s) the P/T/Z parameters computed based on the information from the static camera 220, PZT_(i) ^(p) P/T/Z parameters from the PTZ camera 250. It is clear from the above formula that the final P/T/Z parameters are the weighted summation of the two sets of parameters from the two trackers. α and β are the fuzzy memberships for the two trackers, computed as follows: $\begin{matrix} {\alpha = \frac{C_{s} \cdot {P\left( V_{s} \middle| V_{s}^{p} \right)}}{{C_{s} \cdot {P\left( V_{s} \middle| V_{s}^{p} \right)}} + {C_{p} \cdot {P\left( V_{p} \middle| V_{p}^{p} \right)}}}} & (2) \\ {\beta \approx \frac{C_{s} \cdot {P\left( V_{s} \middle| V_{s}^{p} \right)}}{{C_{s} \cdot {P\left( V_{s} \middle| V_{s}^{p} \right)}} + {C_{p} \cdot {P\left( V_{p} \middle| V_{p}^{p} \right)}}}} & (3) \end{matrix}$ where C_(s) and C_(p) are the tracking outputs from static camera 220 and PTZ camera 250, V_(s) and V_(p) the velocities of the objects in static camera 220 and PTZ camera 250, and V_(s) and V_(p) the velocities in the static camera 220 and PTZ camera 250 in the previous frames. We model the velocities (V_(s) and V_(p)) using a Gaussian distribution, and V_(s) and V_(s) ^(p) is the probability of V_(s) in the current frame given the velocity in the previous frame is V_(s) ^(p). It is apparent that a α+β=1. The One-Board Solution

A further feature of the system 200 the single DSP board 270, which is the only board needed, in contrast to existing systems requiring up to three PCs. And some of the systems even need a PC running with a real-time operating system. If one uses a normal personal computer, then one must also use a conventional operating system like Windows, which do not give user the high level timing controls. For instance, the user's programs will be interrupted by the operating system's own functions like checking various ports, memory check up. As such, the present invention preferably uses a DSP, which allows for disabling all unnecessary interrupts and, as such, the user's program is in control. Therefore, automatically, it is a true real-time system) Such requirement relegate known dual camera rig systems to the laboratory environment, with exceedingly high cost. The single DSP board system provides a practical solution for large scale deployments.

Most of the computer vision algorithms are computational intensive. And there is no exception for the motion detection and tracking algorithms supporting the system 200. Instead of normal desktop PCs, we use TI DM642 EVM with the TMS320DM642 digital media processor on board. To fit all the algorithms to this single board, the optimization capability of the EVM has been pushed to the maximum level. This highest level of optimization is to preferably write all the software using DSP assembly. Thus, all the codes, implementing the functions shown in FIG. 2B, are loaded onto the DSP board.

Modifications and variations of the preferred embodiment will be readily apparent to those skilled in the art. For instance, various ones of the advantageous features described above can be used separately or in different combinations with other advantageous features. Further, although the present invention is described as a dual camera rig, the same algorithms can be used to implement a triple camera rig with one static camera and two PTZ cameras or an N-Camera rig with one static camera and N-1 PTZ cameras. Other such variations are within the scope of the present invention as defined by the claims. 

1. A method for taking a picture of an object comprising: providing in a substantially fixed positional relationship to each other a camera having a broad focal area and a PTZ camera having an adjustable-in-use focal length: detecting the object within a field of view of the camera; obtaining a size of the object and location of the object within the field of view of the camera; determining, using the size of the object and the location of the object, PTZ parameters that provide a desired location of the PTZ camera, the PTZ parameters including a focal length PTZ parameter that will allow for the PTZ camera to take a recognizable picture of the object; moving the PTZ camera to the desired location based on the PT parameters; and taking the picture of the object with the PTZ camera based on the determined focal length Z parameter.
 2. The method according to claim 1, wherein: the object moves, and the desired location thus moves, and the step of determining takes into account the movement of the object; and the steps of determining and moving are repeated in order to track the moving object.
 3. The method according to claim 2 wherein during certain ones of the repeated steps of determining, the step of determining uses information derived from image signals captured from the PTZ camera to assist in determining PTZ parameters and, wherein the step of determining uses fuzzy logic to fuse the information from the PTZ camera with information from the camera.
 4. The method according to claim 1 further including the step of automatic self-calibration of the position of the camera and the PTZ camera relative to each other.
 5. The method according to claim 1 wherein the automatic self-calibration uses a feature point tracking algorithm to obtain a correlation map between the static camera and the PTZ camera.
 6. The method according to claim 1 wherein the step of determining uses a look-up table as part of the controlling algorithm.
 7. The method according to claim 1 wherein the look-up table provides for a correlation map between size of the object in the camera and the adjustable-in-use focal length.
 8. The method according to claim 7 wherein the look-up table further provides correlation between a signal representative of a location of the object within the field of view of the camera and pan and tilt parameters.
 9. An apparatus for taking a picture of an object comprising: a camera having a broad focal area; a PTZ camera having an adjustable focal length, wherein the camera and the PTZ camera are mechanically mounted such that a fixed position between the camera and the PTZ camera remains substantially constant; a processor electrically connected to the camera and the PTZ camera, the processor: obtaining a size of the object and location of the object within the field of view of the camera; determining, using the size of the object and the location of the object, PTZ parameters that provide a desired location of the PTZ camera, the PTZ parameters including a focal length PTZ parameter that will allow for the PTZ camera to take a recognizable picture of the object; and outputting the PTZ parameters to the PTZ camera, wherein the PTZ camera is adapted to move to the desired location based on the PT parameters and takes the picture of the object using the determined focal length Z parameter.
 10. The apparatus according to claim 9 wherein, the processor determines the desired location using derived from image signals captured from the PTZ camera and fuzzy logic to fuse the information from the PTZ camera with information from the camera.
 11. The apparatus according to claim 9 wherein the processor further provides automatic self-calibration to adjust for small changes in the position between the camera and the zoom camera.
 12. The apparatus according to claim 11 wherein the automatic self-calibration performed by the processor includes using a feature point tracking algorithm to obtain a correlation map between the static camera and the PTZ camera.
 13. The apparatus according to claim 9 wherein the processor is housed on a single printed circuit board.
 14. The apparatus according to claim 9 wherein a housing of the zoom camera is mechanically mounted with another housing of the camera.
 15. The method according to claim 14 wherein the camera and the PTZ camera are mounted vertically on top of each other.
 16. The method according to claim 9 wherein the processor is a standalone single board digital signal processor.
 17. The apparatus according to claim 16 wherein the processor is housed on a single printed circuit board.
 18. The apparatus according to claim 9 wherein the computer repeats a sequence of inputting, determining and outputting and wherein the picture is retaken during each sequence. 